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Abstract 

A closed-form expression and an upper bound are obtained for the mean 
absolute error of the unbiased estimator of a probability in inverse binomial 
sampling. The results given permit the estimation of an arbitrary probability 
with a prescribed level of the normalized mean absolute error. 
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1 Introduction 

The estimation of a probability from a set of observations is one of the most basic 
problems in statistics, with many applications in signal processing and related fields. 
One commonly used technique is inverse binomial sampling (also termed negative- 
binomial Monte Carlo). Given a sequence of independent Bernoulli trials with 
probability of success p, this technique consists in observing the sequence until a 
given number N of successes is obtained. The resulting number of trials is denoted 
as n. The uniformly minimum variance unbiased estimator of p, for A'^ > 2, is [T] 

. 1 

For A > 3 the mean square error (MSB) of ^ is known to satisfy E[(p — < 
1/(A — 2) irrespective of p [ij. Recent works [2] [3] have shown that, for the modified 
estimator p = (A — l)/n, the confidence level associated to a relative interval of 
the form [p/ fi2,PIJ-i] also satisfies a lower bound irrespective of p, for A > 3 and 
a certain range of /ii,/i2 values. A similar result also holds for the estimator ([1]), 
albeit for a reduced range of /ii, /i2 values [3]. 

This paper analyzes the mean absolute error (MAE) of the estimator ([1]), for 
A > 2. Although the MAE is less often used than the MSE, it is a more natural 
error measure, and has several advantages [S] [B]. It is simpler, it has a clearer 
meaning, and it is less sensitive to outlying values. Apparently, its lack of use is in 
large part motivated by the analytical difficulty associated to the absolute value [5] 
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2 Result 

Let no = [(A^- l)/pj+l and un = 2exp(-7V + l)(iV- l)^-V(iV-2)!. For iV > 2, 
the MAE of the estimator ([T|) satisfies the following. 
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< UN for all p G (0, 1). (4) 
Proof. Given N , the probability function of n, denoted as fN{n) ~ Pr[n — n], is 

= (^l^^)p^(i-pr-^, ^>A^- (5) 

The corresponding distribution function is denoted as Fpf{n). 

The MAE is computed, using the identity fN{'n)/{n—l) — pfN-i{n — l)/{N—l), 

as 



n=7V r!=Af / \ n=N n=N ) 

^ 2p[Fjv-i(no - 1) -FAr(no)]. 

(6) 

Let &n,p(i) denote the binomial probability function with parameters n, p evaluated 
at i. Taking into account that i<jv-i(^io ~ 1) = -Pjv('t-o) + (1 ~ p)bno-i,p{N ~ l)j 
from ([6]) it stems that E(|p — = 2(1 ~ p)bn„-i,p{N — 1), which establishes ([2]). 
The limit result ^ follows from the Poisson theorem ^8^ . 

Let /„ = {{N - l)/no, {N - l)/(no - 1)]. To prove it is noted that all values 
p (z In give no — n. From (|2|. 

omp ~ pm ^ 2 h - - pr^-^iN - 1 - nop), 



dp \N -I 

which is negative for p G /„. Therefore, it is sufficient to consider p restricted to 
the set S ^ {p€ (0, 1) I (iV - l)/p £ N}; and 

iV - 1 

no = hi for p e S". (7) 
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Defining 

.^ilni^^^^yiffi-^, (8) 

V E(|p-p|)/p 

the inequality ([4]) is equivalent to x > 0. It follows from (HI), ([3]), (O and ([8]) that 



1 v-? / ip \ IfN-l \ , , N-l 
-El^ 1-1^ iV + 2 In(l-p) . (9) 
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Equation has the same form as [3J eq. (17)]. However, the result in the cited 
reference which establishes the nonnegativity of x is not applicable, and a separate 
analysis is required. The variable x can be written as X^jlo ^iP^ with 

1 .j+i A^- 1 _ A^- 2 

(j + l)(7V-ip-+i + 2 j + l- ^^^> 

For N — 2, Xj reduces to l/(j + 2), and is thus positive. For > 3, using the 
inequality [3j J2Z~i^ > (iV - 2y+^/{j + 2) in 1^, 

(j + + 2)x, >{N-2)[\- j + J + 3 - TV. (11) 

Let yj denote the right-hand side of (fTTI) . Computing dyj/dj as if j were a contin- 
uous variable and using the inequality ln(l + t) > t/(l + t) gives 

Thus yj > Ho = 1/(TV — 1) > for any j > 1, which implies that all the coefficients 
Xj are positive. Therefore, a; > for > 2. This establishes (jH). 

The positivity of xj for all j > 0, A^ > 2 implies that the values of xp corre- 
sponding to p = (A^ — l)/n, n^A^ — 1,A^, A^-|-l,... form a decreasing sequence. 
Together with the negative character of 9[E(|p — p\)/p]/dp for p e /„q, this implies 
that E(|p — p\)/p is a monotonically decreasing function of p. □ 



3 Discussion 

The result above allows the estimation of a probability p with a prescribed value 
of the normalized MAE, E(|p — p\)/p- This value is guaranteed irrespective of the 
unknown p. For example, if a normalized MAE not exceeding 10% is desired, A^ = 65 
suffices, according to @. 

The behaviour of the normalized MAE as a funcion of p is depicted in Figure [l] 
with solid lines. The discontinuity of the derivative at the points p = [N — l)/n, 
n G N (see proof of theorem) can be clearly observed, specially for low A^ and large 
p. Figure [2] shows the bound (U as a function of N . The bound for the normalized 
root mean square error (RMSE), (E[(p — p)"^])^^"^ /p, is also shown for comparison. 
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Figure 1 : Normalized MAE as a function of p 
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Figure 2: Bounds on normalized MAE and RMSE as a function of A'' 
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Both error measures are seen to have the same type of behaviour, with MAE lower 
than RMSE. 

It is interesting to compare the bound ^ with the normahzed MAE resuhing 
from a fixed sample size n. This is obtained from [9', eq. (1.1)] as 



n\p-p\] r.f n-l 



P \No-l 



with A^o = L^-PJ + 1- Since the average sample size in inverse binomial sampling is 
N/p, the comparison is restricted to probabilities p such that N/p is an integer value, 
and the sample size n in the fixed case is taken equal to this value. The resulting 
fixed-size normalized MAE is shown in Figure [T] with dashed lines. Dividing ([2]) by 
([T2|) with n — N/p, it is easily seen that, for p ^ 0, the MAE with inverse binomial 
sampling is asymptotically (1 + 1/(A^ — l))^^+^e times larger than the MAE with 
fixed size. This value is close to 1 except for very small values of N. This is observed 
in Figure [U which also shows that the MAE ratio is approximately maintained for 
all values of p. It is thus concluded that, in order to guarantee a given normalized 
MAE, inverse binomial sampling gives an average sample size that is only slightly 
larger than the sample size that would be necessary in the fixed case (which is a 
function of the unknown p) . 



4 Conclusion 

The MAE in inverse binomial sampling has been analyzed. It has been shown that 
the estimator guarantees a certain value of the normalized MAE. This result is 
analogous to that known for the MSE, and allows to select a value of N that meets 
a prescribed error level irrespective of p. 
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